Principal component-based weighted indices and a framework to evaluate indices: Results from the Medical Expenditure Panel Survey 1996 to 2011
نویسندگان
چکیده
Producing indices composed of multiple input variables has been embedded in some data processing and analytical methods. We aim to test the feasibility of creating data-driven indices by aggregating input variables according to principal component analysis (PCA) loadings. To validate the significance of both the theory-based and data-driven indices, we propose principles to review innovative indices. We generated weighted indices with the variables obtained in the first years of the two-year panels in the Medical Expenditure Panel Survey initiated between 1996 and 2011. Variables were weighted according to PCA loadings and summed. The statistical significance and residual deviance of each index to predict mortality in the second years was extracted from the results of discrete-time survival analyses. There were 237,832 surviving the first years of panels, represented 4.5 billion civilians in the United States, of which 0.62% (95% CI = 0.58% to 0.66%) died in the second years of the panels. Of all 134,689 weighted indices, there were 40,803 significantly predicting mortality in the second years with or without the adjustment of age, sex and races. The significant indices in the both models could at most lead to 10,200 years of academic tenure for individual researchers publishing four indices per year or 618.2 years of publishing for journals with annual volume of 66 articles. In conclusion, if aggregating information based on PCA loadings, there can be a large number of significant innovative indices composing input variables of various predictive powers. To justify the large quantities of innovative indices, we propose a reporting and review framework for novel indices based on the objectives to create indices, variable weighting, related outcomes and database characteristics. The indices selected by this framework could lead to a new genre of publications focusing on meaningful aggregation of information.
منابع مشابه
Analysis of Economic Determinants of Fertility in Iran: A Multilevel Approach
Background During the last three decades, the Total Fertility Rate (TFR) in Iran has fallen considerably; from 6.5 per woman in 1983 to 1.89 in 2010. This paper analyzes the extent to which economic determinants at the micro and macro levels are associated with the number of children in Iranian households. Methods Household data from the 2010 Household Expenditure and Income Survey (HEIS) is ...
متن کاملObesity indices among infants and their parents, Shiraz, Iran
Background: Infantile obesity is becoming increasingly recognized as one of the public health problems in Iran. Objective: Obesity charts of a cohort of 317 healthy infants and their parents living in Shiraz (Southern Iran) are presented and the familial pattern of infants’ obesity with that of its parents explored. Methods: An adjusted weight-for-height index was used to develop power type obe...
متن کاملارزیابی کیفیت خاک در کاربریهای مختلف زمین با استفاده از روشهای آماری چند متغیره
The aim of the study was to investigate the effects of land use on soil quality parameters using multivariate statistical analysis. Soil samples (0-25 and 25-50 cm depths) were taken from three land uses in forest area of Marivan including forest, rangeland, and cultivated land. Soil characteristics of pH, EC, sand, silt, clay and CaCO3 content, water-stable aggregates and their organic carbon ...
متن کاملارزیابی تحمل به خشکی در لاینهای گندم
This study was conducted at Research Farm of Isfahan University of Technology to evaluate drought tolerance potential of 23 F2:4 wheat lines derived from the cross of Virmarin (susceptible line) and Sardari (tolerant line). A randomized complete block design with three replications was used in each irrigation treatment (i.e. irrigation after 70±3 and 120±3 mm evaporation from class A pan for ...
متن کاملشناسایی ژنوتیپ های متحمل به تنش خشکی در گندم دیم با استفاده از شاخص های تحمل خشکی
Among different environmental stresses, drought is of great importance that induces a highly negative effect on crop production. In order to evaluate drought tolerance in dryland wheat genotypes, 36 genotypes were studied in a randomized complete block design with three replications under rainfed (drought stress) and supplemental irrigation conditions during 2016–2017 growing season in Research...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 12 شماره
صفحات -
تاریخ انتشار 2017